| Name | Version | Summary | date |
| ragctl |
0.1.3 |
RAG Studio - Production-ready RAG toolkit with advanced OCR, semantic chunking, and intelligent document processing |
2025-10-30 10:41:20 |
| pptagent |
0.2.14 |
PPTAgent, a tool for utilizing LLMs to generate PowerPoint presentations from documents. |
2025-10-29 09:19:23 |
| blocknote-py |
0.3.1 |
🚀 BlockNote Python library - Convert BlockNote.js blocks to HTML, Markdown, PDF & JSON. Type-safe Pydantic models for Django, FastAPI, Flask backends. Rich text editor content processing made easy. |
2025-10-28 20:53:22 |
| xml-analysis-framework |
2.0.0 |
XML document analysis and preprocessing framework designed for AI/ML data pipelines - part of the unified analysis framework suite |
2025-10-28 01:49:57 |
| eless |
1.0.3 |
Evolving Low-resource Embedding and Storage System - A resilient RAG data processing pipeline with comprehensive logging, multi-database support, and CLI interface. |
2025-10-27 10:50:19 |
| parze |
0.1.1 |
Python SDK for the Parze API |
2025-10-23 12:23:30 |
| flockparser |
1.0.5 |
Distributed document RAG system with intelligent GPU/CPU orchestration |
2025-10-21 19:53:44 |
| aimq |
0.1.2 |
A robust message queue processor for Supabase pgmq with AI-powered document processing capabilities |
2025-10-21 11:51:49 |
| docx-mcp |
0.1.7 |
DOCX MCP处理器 - 完整的Word文档处理工具,支持图片编辑和表格操作 |
2025-10-19 04:03:47 |
| quanta-pdf |
1.0.3 |
Advanced PDF layout analysis engine for extracting figures, tables, and structured content |
2025-10-18 04:08:33 |
| docstrange |
1.1.7 |
Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, JSON, CSV, HTML) with intelligent content extraction and advanced OCR. |
2025-10-14 11:34:19 |
| kreuzberg |
3.20.2 |
Document intelligence framework for Python - Extract text, metadata, and structured data from diverse file formats |
2025-10-11 18:34:25 |
| chunking-strategy |
0.4.1 |
A comprehensive chunking library for text, documents, audio, video, and data streams (Linux and macOS only) |
2025-10-10 11:26:49 |
| markitdown-chunker |
0.1.0 |
Convert documents to markdown, chunk them intelligently, and export structured data |
2025-10-10 01:05:22 |
| ragger-python-sdk |
0.1.3 |
Python SDK for ragger.ai RAG API |
2025-10-09 20:47:03 |
| ShrutiAI |
1.0.1 |
Python SDK for interacting with the shrutiAI API - your AI-powered assistant |
2025-09-18 11:38:01 |
| pdf2markdown |
0.3.0 |
Python library and CLI tool that leverages LLMs to convert technical PDF documents to well-structured Markdown |
2025-09-14 02:02:58 |
| qdrant-loader |
0.7.3 |
A tool for collecting and vectorizing technical content from multiple sources and storing it in a QDrant vector database. |
2025-09-11 07:33:39 |
| bank-statement-separator |
0.3.0 |
AI-powered tool for separating multi-statement PDF files using LangChain and LangGraph |
2025-09-10 14:48:39 |
| docling-onnx-models |
0.1.3 |
ONNX Runtime implementations for Docling AI models |
2025-09-09 08:45:47 |